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Abstract 

We prove variational forms of the Barban-Davenport-Halberstam Theorem and the large 
sieve inequality. We apply our result to prove an estimate for the sum of the squares of prime 
differences, averaged over arithmetic progressions. 

1 Introduction 

The prime number theorem implies the asymptotic ^l^{x) ~ x, while the Riemann hypothesis 
predicts a bound of \tp{x) — x\ ajs"*"^ on the error term. This extends naturally to arithmetic 
progressions, where the asymptotic ^{x; q, a) ~ holds for all coprime a and q. We recall 
that 

ij{x;q,a) := ^ A(n). 

n<x 
n=a mod q 

Under the Generalized Riemann hypothesis, one obtains the error bound ip{x; q, a) — <Ce 

j;2+'^. The stronger bound of <Ce x2+'^0(g)^2 is also conjectured. (For further definitions, see 
Section [2j For background material, see [2].) 

An unconditional bound on the averaged error term for this is provided by the Barban- 
Davenport-Halberstam Theorem [2], which states: 

Theorem 1. (Barban-Davenport-Halberstam) Let A > 0. For all positive real numbers x and 
Q satisfying x{\og{x))~^ <Q<x, 

2 



q<Q a<q 
(a,q) = l 

We note that this holds also for the quantity 6(x; q, a), since the differences between ipix; q, a) 
and 9{x; q, a) are of lower order. 

An even stronger bound is due to Montgomery [12], refining work of Uchiyama [15], (see 
also the refinement of Hooley fB] ) : 

Theorem 2. Let A > 0. For all positive real numbers x and Q satisfying x{log{x))^^ < Q < x, 
Vmax V ('il^{y;q,a) - -J--] ^axQ log{x). 

t^Q^^^ t^q V 'A(g)y 

(a,g)=l 
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We note that the quantity on the left has potentially increased compared to the quantity in 
Theorem [H while the bound on the right is the same, up to the implicit constant. 
Another variant of Theorem [T] is due to Uchiyama |15] : 

Theorem 3. Let ^ > 0. For all positive real numbers x and Q satisfying x{log{x))~^ < Q < a^, 
^ ^ max f 9, «) - -p: ) -^a xQ log^ (x) . 

q<Q a<q ^ 'f'wi / 

(a,q)=l 

This is incomparable to Theorems [T] and [51 since the quantity being bounded is larger and 
the bound obtained is worse. Hooley, in [7], has announced a refinement to the log^(A^) for 
certain values of Q. This seems to have not yet appeared, however. 

We work with the function 9 instead of ijj because it is more convenient for our purposes, 
though this is a minor difference. To further refine our understanding of the deviation of 6{x; q, a) 
from its average value of we introduce a variational operator in place of the maximal one 

in Theorem [3l Letting {cn}n=i be a finite sequence of complex numbers and letting denote 
the set of partitions of [N] := {1, 2, . . . , N} into disjoint intervals, we define the r-variation of 
the sequence to be: 



1/ \^ II — 

|lCn/n=l||yr ■— 



TTG-PjV 



nel 



We can think of 6{x; q, a) as a sum over a sequence {bn}n=i, where N = [x\ and 

bn ■= 



log(n), n prime; 
0, otherwise. 



For an interval /, we define 



9{r,q,a) := 



Letting |/| denote the number of integers contained in /, we then have that 



max^S^ ( 9(1: q, a) ' 



is the square of the 2-variation of the sequence {6„ — 1/ (t){q)}^^i. 

Our main result is an upper bound on this quantity, summing over q < Q and a coprime 
to q as in the above theorems. This is a strengthening of Theorem [3l since we obtain the same 
bound (up to the implied constant) on a larger quantity. To simplify our notation, we let Vx 
denote the set of partitions of {1, ... , [xj} into disjoint intervals. We prove: 

Theorem 4. Let ^ > 0. For all positive real numbers x and Q satisfying 3;(log(a;))^'^ < Q < 2;, 

12 Yl (^(^'«''') " JSy) -^AxQiog^ix). 

q<Q a<q ^ ''lew ^ / 
{a,q)=l 

We also establish a variant of this, obtaining a better bound by allowing the partition to 
depend only on q and not on a: 
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Theorem 5. Let A > 0. For all positive real numbers x and Q satisfying x{log{x)) ^ < Q < x, 
E™^^ E E(^(^!"'«)- J^) -^AxQlog'ix). 

q<Q a<q I&ir ^ "rKHj/ 

(a,g)=l 

In comparison to Theorem [21 this maximizes over partitions instead of restricting to partial 
sums, but the bound obtained is worse by a muhiphcative log(x) factor. 

The introduction of this maximum over partitions allows us to apply our theorem to prove 
a weakened, averaged version of a conjecture made by Erdos. We let pi denote the i^^ prime. 
Erdos made the following conjecture: 



Conjecture 6. (Erdos, l^) 



X] ~Pif ^ Xlog(x) 

Pi+i<x 



This asymptotic is heuristically suggested by the prime number theorem, which implies the 
reverse inequality. Assuming the Riemann Hypothesis, Selberg obtained the bound 

E -Pif ^ xlog^(x). 

It is natural to extend the conjecture to arithmetic progressions. Fixing a, q such that 
(a, g) = 1, we let p"'"^ denote the i*^ prime congruent to a modulo q. One then formulates the 
conjecture as: 

Conjecture 7. For any a, q such that (a, q) = 1, 

(a,q 1,9 \ 2 i / \ 

If we then sum over all g < Q and all a coprime to g, we would expect to get <C Qx\og{x). 
We derive the following weaker bound: 

Corollary 8. Let A > 0. For all positive real numbers x and Q satisfying x{\og{x))~^ < x, 

a.q o,,q\ 2 



q<Q a<q p^:\<x 

This can be viewed as an averaged, unconditional version of Selberg's bound, and is easily 
obtained from Theorem [H 

More generally, the study of variational quantities introduces new and interesting questions. 
For example, is there an elementary function / such that 

/Gtt VnG/ / 

The prime number theorem gives an (asymptotic) lower bound of xlog(x) on this quantity. 
We note, however, that one cannot hope to have f{x) = xlog(x). This follows from the work 
of Cheer and Goldston [1], who proved 
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Theorem 9. For any e > 0, there exists an Xq such that for all x > Xq, 

J2 \Pi+^-P^\^ ^ (193/192 -e)x log X. 

Pi+i<x 

(Note, as seen from Lemma [TOl below, the contribution to ([1]) from prime powers is of lower 
order.) This does not rule out the possibility of f{x) = Cx log(x) for some larger C, for example. 



2 Preliminaries 

We first recall some standard definitions. When g is a positive integer, (j){q) denotes the Euler 
totient function. For positive integers a and q, (a, q) denotes the g.c.d. of a and q. 
For a positive real number x, we define 

n<x p°'<x 

^(x) = J]]log(p). 

Here, log denotes the natural logarithm. The latter sum for ip is over prime powers p", while 
the sum for 6 is over primes p. A(n) denotes the von Mangoldt function, which is equal to 
log(p) whenever n is a power of a prime p and equal to otherwise. 
Letting a and q be positive integers, we similarly define 

^^J{x■,q,a)= ^ A(n), e{x;q,a) = ^ log(p). 

n<x P^x 
n=a mod q p=a mod q 

Letting I be an interval, we also define 

iP{I;q,a)= ^(^)' 0{I;q,a) = ^ log(p). 

n£l pel 
n=amodq p=amodq 

The size of an interval / is defined to be the number of integers it contains, and is denoted by 
|/|. For a fixed positive real number x, we let Vx denote the set of all partitions of [1, [x\] into 
intervals. Thus an element vr G "P^ is a collection of disjoint intervals whose union is the interval 
from 1 to [xj . 

We recall the prime number theorem, which states that il^ix) ~ x. We will later also use the 
following standard fact (we include the short proof here for completeness) : 

Lemma 10. For x > 2, 9{x) = ■ip{x) + 0(x2). 

Proof. Since < x holds if and only if p < x^ , we have 



'ip{x) = 9 (x^j and ip^x) — 0{x) = 6 ix'^ 



a=l a>2 



Noting that x^ > 2 only for a = O(logx) and 0(x«) < i/)(xq) ^ x^, we see this is ^ 
1 i 1 

X 2 + X 3 log X ^ X 2 . □ 
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3 A Variational Form of the Barban-Davenport-Halberstam 
Theorem 



We now prove: 

Theorem 4. Let A > 0. For all positive real numbers x and Q satisfying x(log{x))~^ ^ Q ^ x, 
Yl Yl (^(^5^'") " J^) -^AxQlog^ix). 

q<Q a<q ^ leir ^ VKH) J 

{a,q)=l 

We will deduce this by combining the proof of the standard Barban-Davenport-Halberstam 
theorem with some combinatorial arguments and a variational form of the Siegel-Walfisz The- 
orem that is developed in the following subsection. 

For a fixed positive integer q, we consider Dirichlet characters modulo q. A function x ■ 
Z* — 7> C is called a Dirichlet character modulo q if it is a group homomorphism. We can extend 
such a X to be a function from Z to C by defining x(^) to be equal to the value of the character 
on the residue class of n modulo q when n is coprime to q and otherwise. From now on, 
we will consider Dirichlet characters to be functions on Z. A character x mod q is said to be 
primitive if its period as a function on Z is precisely q (conversely it is non-primitive if it has a 
smaller period dividing q). 

We fix positive integers M and A^. Given a Dirchlet character x mod q and complex numbers 
{o-n}n=M+i^ we define 

M+N 
n=M+l 

More generally, for any interval / C [M + 1, M + A], we define 

The large sieve inequality [2] states: 

Theorem 11. (The Large Sieve Inequality) For any positive integers Q,M,N and complex 
numbers {an}n=M+i- 

M+N 

g<Q Xmodg n=M+l 

Here, the the inner sum ^* is over the primitive characters modulo q (this is what the * 
superscript signifies). 

In our proof of Theorem HI we will use the large sieve inequality directly as it is stated above. 
However, we will later establish variational versions of this in Sections U] and [5l 

3.1 A Variational Form of the Siegel-Walfisz Theorem 

For a positive real number x and a Dirichlet character xiiiod(7, we define 

'>P{x,x) = X] x{p°')'^og{p), and 9{x,x) = ^x(p)log(p)- 

p°'<x P<2^ 



5 



For an interval /, we similarly define 



0{I,X) = ^X{p) log(p). 

pel 

We refer to the unique x mod q that takes the value 1 on all integers coprime to q as the principal 
character modulo q, and all other characters as non-principal. 
The Siegel-Walfisz Theorem [2] states: 

Theorem 12. (Siegel-Walfisz Theorem) Let A be a positive real number. Then there exists 
some positive constant ca depending only on A such that 

\iP{x,x)\ <A xe-^^i°s^(^') 

for all non-principal characters x^odq for all moduli q < log'^(x). 

We will find it more convenient to work with the following corollary: 

Corollary 13. Let A be a positive real number. Then there exists some positive constant ca 
depending only on A such that 

\9{x,x)\ «A xe-'^^i^s^^") 
for all non-principal characters x^odq for all moduli q < log'^(a;). 

Proof. By the triangle inequality, < + li^i^^x) ~ (^{xjX)\- The first quantity 

is bounded by Theorem 1121 To bound the second quantity, we observe 



{x,x) - dix,x)\ 



p" <x 
a>l 



p°'<x 



by the triangle inequality and the fact that |x(p")| is always either or 1. Applying Lemma 
[TOl we see that \^{x,x) ~ Gix,x)\ ^ X2 , where the implicit constant is independent of q and 
X- □ 

We now prove a variational form of this: 

Lemma 14. Let A be a positive real number. Then there exists some positive constant c'j^ 
depending only on A such that 



max V |6l(/,x)P <A xe""^' 



(2) 



for all non-principal characters x^odq for all moduli q < log'^(x). 

Proof. Since every / € tt is a subinterval of the left hand side of ([2]) is 



^ ' max |6'(J,x)I• 



(3) 



/G71 



We consider the inner quantity ^/g^^ \^i^iX)\- By definition, we have 



^Xip) log(p) 

pel 
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Applying the triangle inequality, this is 

< Ix(p)log(p)| < ^^log(p) = 9{x). 

Here, we have used the fact that is always either 1 or 0. 

We then have that ^ is 

< y^(x)^max |0(J,X)|. 

Since 6{x) < Tp{x) <C x, this is 



« /x max |0(J,X)|. (4) 

y jc[i,x] 

We consider the quantity maxjc[i^x] We observe that this is <C maxy<^ \0{y,x)\- 

We will upper bound this quantity for each y separately. For larger y values, we will employ 
Corollary [13] for the value 2A (using 2A instead of A will allow us to apply the corollary to a 
larger range of y values). We let C2A denote the constant for 2 A in the exponent. More precisely, 
for y such that q < log^^(y), we have \9{y,x)\ ?/e~'^2A log^ (y) |-,y Corollary [131 Since y < x, 
this is <A a^e-'=2Ai°g'(^). 

We now consider y such that log^'^(y) < q. This is equivalent to the condition y < e'^^ . For 
these small y values, we will use the basic estimate \0{y, x)\ ^ Oiu) <C y- Since log'^(x) > q holds 
by assumption, we have log(x) > q^ > log^(y). We then have y < e^°s^(a;) r^^-c2Aiog"2 (x) ^ 
Hence, 

max|6'(y,x)| <A xe'^^A logi(x)^ 

y<x 

Thus, the quantity in (jl]) is 



1 



This proves Lemma [HI with := ^ • C2A- D 

We note that, conditional on the generalized Riemann hypothesis, for a nonprincipal Dirich- 
let of modulus q one has the bound 

I'fpix, x)\ ^ x^^'^ log(x) log{qx) 

where the implied constant is absolute (see Theorem 13.7 in [Uj). This can be used as in the 
argument above to obtain: 

Lemma 15. Let x be a nonprincipal character mod q. Assuming the generalized Riemann 
hypothesis, we have that 

max V] |6'(/,x)|^ < x^/^log(x)log(g'x). (5) 

This could be used in place of Lemma [HI in the following arguments to conditionally extend 
the range of Q in the statements of Theorems [D and [SJ This is quite routine, and we omit the 
details. It may be possible to further improve (conditionally) the exponent of the x^^^ term. 
We leave this as an interesting open problem. 
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3.2 Proof of Theorem H 

We now bound the quantity 



(a,g)=l 

The structure of our proof will resemble the proof of the non- variational version of the theorem 
in 12]. 

We let 2*^ denote the smallest power of two that is > x. We can then decompose [1, 2^] into 
dyadic intervals Ic/ = ((c — 1)2^, c2^], where i ranges from to 2^ and c ranges from 1 to 2^~^. 
We note the following lemma [§J: 

Lemma 16. Any subinterval of S C [1,2'^] can be expressed as the disjoint union of intervals 
of the form If,/, such as 

m 

where at most two of the intervals L. p in the union are of each size, and where the union 
consists of at most 2k intervals. 

In other words, each / C [x] can be decomposed as a disjoint union of these dyadic intervals 
using at most two intervals on each level i. We let D{I) denote the set of dyadic intervals in 
the decomposition of I. We observe 



J£D{I) 



for any I, since X]jgD(7) 1*^1 ~ l-^l- -^^^^ each we let Dp{I) denote the intervals in D{I) on level 
t (so \Di{I)\ < 2). We can rewrite dSD as: 



^ 2 

\J\ 



E E E E ov---'") 

q<Q a<q len \e=0 JeDe{I) 

{a,q) = l 

By the triangle inequality for the i'^ norm, we have 

/ 



E E sEf^(^'«'«)-J5T) «E 



E E sEI E ^(^;«'«)- 



\ q<Q a<q l£n ^ I q<Q a<q l£n \JsDi{I) 

\ (a,g)=l \ {a,q)=l 

(7) 

Since \Di{I)\ < 2 for all i, I and each dyadic interval can appear in D{I) for at most one / G vr, 

E(E/(^;-)-i^) «EK;-)-iJ) 



2 



max 



for all a, g, I. Therefore the quantity in ([7]) is 

^, ( nk-e „\ 

«E 



E E e(^(^'=.^;'?'«) 

n<0 a<q c=l 



.V 



. q<Q a<q c 



(8) 
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For each fixed we consider the quantity 



a<Q a<a c=l ^ V^V^;/ 



(9) 



q<Q a<q c 
{a,q)=l 



We will bound this quantity using the large sieve inequality and the Siegel-Walfisz Theorem, 
so we need to first express it in terms of characters % modulo q. For this, we will introduce 
some convenient notation. Fixing q, we let xo denote the principal character modulo q. For any 
character % modulo q, we define 



d{I,Xo)-\I\, X = Xo- 



We will employ the following lemma: 
Lemma 17. For any interval I and any coprime positive integers q, a, 

X^odq 

where x denotes the character obtained from x by complex conjugation. 
Proof. We note that for each integer n. 



^ ^ \ 0, otherwise. 



Xmodg 

For any integer a coprime to q, we let a denote an integer such that aa = 1 mod q. Then, for 
any xmodg, x(o)x(™) = x(^)x('^) = xi^i^)- Since an = Imodg if and only if = amodg, we 
have 

1 _/ X / N f 1, n = amodg; 



(f)(q) Yl X(a)x('v 1 otherwise, 

Xraodq 



We then observe 



9{r,q,a) = ^°S{P) 
pel 

p=amod q 

-^^log(p) x{a)x{p) 



pel X mod q 

Xmodq 



By definition of 6'{I,x), it then follows that 



Xmodq 



□ 
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The quantity Q can then be expressed as 



q<Q c=l a<q 

(a,g)=l 



X mod q 



(10) 



For fixed q and c, the inner quantity can be expanded as: 



E E E Xl{a)X2ia)e'iIc,e,Xi)0'{Ic,i,X2) 



(a,q)=l 



Reordering the sums, this is 



J2 E (^'iIc,i,Xi)0'{Ic,e,X2) XT(a)x2( 



a<q 
(a,g)=l 



The innermost sum is now the inner product of the characters Xij X2- Since the distinct charac- 
ters modulo q are orthogonal under this inner product, this innermost sum is unless xi = X2- 
Therefore, ([TO]) is equal to 

EiyE E m.x)r (11) 

q<Q c=l xmodg 

In order to use the large sieve inequality as stated in Theorem W\\ we need to adjust our 
character sum to be over the primitive characters modulo q instead of all characters modulo q. 
For this, we first note that every character x modulo q is induced by some primitive character 
Xi modulo qi where qi < q. We then have: 

Lemma 18. For any I and any character x modulo q induced by xi modulo qi, 

\e'{i,xi)-e'{i,x)\<iog{q). 

Proof. For all integers n coprime to q, xi^) = Xi(^)- I^i fact, 

0'(/,Xi)-0'(/,x) = E^i(^') log^^')- 



pel 



Therefore, 



\e'{i,xi)-e'{i,x)\<Y^og{p)<iog{q). 



pel 
p\q 

To see the final inequality, consider the prime factorization of g = p"^ • • •p"''. Then log(g) = 
ailog(pi)H harlog(pr)- □ 

As a consequence of Lemma \T8\ we have \9'{Ic/, x)l^ ^ \(^'{Ic,e, Xi)P + log^ Q for all dyadic 
intervals Ic/ and all non-primitive characters x- Thus, the quantity in (jlip is 



2 

« EtLe E {^oe{,)+\eVc,,xi)n 



q<Q 



c=l xmodg 
2k-l 



q<Q ^^^^ c=l xmodg q<Q ^^^^ c=l xmodg 



il2) 
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As above, xi here denotes the primitive character that induces x- 

We bound the contribution of this first sum to the quantity in ^ (noting that there are 
characters modulo q): 



o<t<k Y q<Q \o<f<fc / Y i<Q 

Since dH) is an upper bound on the square root of ([6]), the contribution to (l6|) is therefore 
<C 2^Q\o^{Q) <C xQlog^(x), which is acceptable. 
It thus suffices to consider 

Et^ti:' e w(u,,x.)? (13) 

q<Q c=l xmodg 

for each fixed i. Each primitive character xi modulo qi induces characters x modulo q for every 
q that is a multiple of qi . We can use this to rewrite the above quantity in terms of a sum over 
only primitive characters: 

-EE E'i«v.'.rf(E^,) 

q<Q c=l xmodg \j<^ -tkjhj 

We note that 

E^«'^('^)"'i°g(2Q/^) 

J— q 

(see [2 1, pp. 163), so this is 

«E^>»<f)E E*i*u..x)i^ (14) 

q<Q \ 'i / c=l xmodg 

We will split this sum over q < Q into ranges and bound each piece separately. For a fixed 
I < U < Q, we consider 



E E ^'°<f) rmi...xf- 

c=l U<q<2U ^^^^ \ 'i / xmoda 



(15) 



Note that we have switched notation from 9' to 9 here without changing the quantity, since 9' 
and 9 only differ on the trivial character, and this is only included in the primitive characters 
modulo q when q = 1. Since U > 1 and our sum here is over q > U, 9 and 9' behave identically 
here. 

We observe that for each fixed c, the contribution to (1151) is 



«^-''°<^)E^n««*x)i'- 

V / „^'^TT -.^^^ „ 



q<2U x™odg 

Letting Op := log(p) for primes p and a„ := for all non-primes n, we apply Theorem [11] to see 
that this is 
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We define Qi := log^~'"^(x). We consider tlie values U = Q2 ^ as j ranges from to 
J := l'log(^)]. We then liave: 

We expand the latter quantity as 

Inserting this into the sum over the c values, we then have 



c=l Qi<q<Q ^^^^ ^ ^ ^Xmodg 

«l:Vc,.iQ^^ (eo- + i)2M ( E ) + Eq (Ej'2"^' ) I E 

We observe that 

E E i°g'(?') = E i^g'^^')' 

c=i pe/c,£ p<2'= 

since the union of the intervals Ic/ as c ranges from 1 to 2^~^ is equal to the interval [2^^]. We 
then have that ^p<2fc log^(p) <C fe2'^. We also note that 

J J 
E(J + 1)2-'' = J2-^+i + 1 <C J2-^ and EO' + 1)2"^' < l- 
j=o i=o 

Therefore, we have shown 

2^-1 

E E -^log(^) E* l^(^^.^'^)l'«^"'(2^)(^2^)(^2'=) + g(A;2^). (16) 

Recalling that 2^^ <g; x. A; <C log(x), J = [log(^)], and Q < a;, we see that the contribution to 
from g's between Qi and Q is 

< Qr^(20(xlog2(x)) +Q(xlog(x)). 

We now consider values of g < Qi. For every primitive character x modulo q where q> 1, 
X is non-principal. Note that for the principal character xo modulo 1, 6'{Ic/, Xo) = (^{Ic,i, Xo) ~ 
\Ic,e\ = 0. Thus, the contribution to (|14p from values q < Qi can be written as: 

l<9<Qi ^xmodijc=l 
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This innermost sum over c is a sum over a partition of [2^], so we can apply Lemma [TJ] (with 
A + 1 as the constant) to conclude that 

c=l 

for some positive constant ca depending only on A. The quantity in (jl7p is then 

<A Qilog(Q) -x^e-^^i^s^W. 

Putting this all together, we have that the quantity in ()14p is: 

<A Qi log(Q) • x2e-^^i°g^("') + Q^\2^){x\og\x)) + Q(xlog(x)). 
Thus, the contribution to dH]) is bounded by: 



<A X] V<5i log(Q) • x2e-^'AlogMx) + Q-i(2^)(xlog2(x)) + Q(xlog(x)) 



^=0 



<A l0g(x)V(5ll0g(Q) •x2e-^Alog^(x) + -^g-lx2log2(x) + log(x)ygOElogM). 

Hence the contribution to ([6]) is bounded by the square of this: 

«A gilog(Q)log2(a;)x2e-^-^'°s*(^) + Qr^x2 log2(x) + Qx log3(x). 
Recalling that Qi = log'^^^(2;) and xlog~"^(x) < Q < x, we see that this is 

<A log^+^(x)x2e-^^i°§^(^) + Qxlog(x) + Qxlog3(x). 



1 



Since e CAiog2(x) j^g i^^^;)^ this first term is <^a Qx\o^{x), as required. This completes 
the proof of Theorem HI 

3.3 An Averaged Variant of Erdos' Conjecture 

We now apply our variational form of the Barban-Davenport-Halberstam Theorem to prove 
Corollary [8l 

Corollary 8. Let vl > 0. For all positive real numbers x and Q satisfying x(log(x))~'^ l£ Q < x, 

a.q a,<?\ 2 



q<Q a<q p''l\<x 

Proof. For each fixed a,q, we consider a partition in Vx containing all the intervals of the form 
By definition of 6 and /j, the quantity 6{Ii;q,a) equals for all i. Hence, 



9{Ii;q,a) 



We note that (p^^^i — 1)^ ^ (Pi+i ~Pi''^)'^ (except for the case where p"^^^ = 3 and p"''' = 2, 
but this only occurs for q = 1 and so can be ignored). Thus, Theorem [J] implies 

E E E (^) ««-°.^(^). 



5<Q a<g p'''9<x 
{a,q) = l 



□ 
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4 Another Variational Form of the Barban-Davenport-Halberstam 
Theorem 



We now prove: 

Theorem 5. Let ^ > 0. For all positive real numbers x and Q satisfying x{\og{x))^^ <x, 

q<Q a<q leir ^ ^^y^) / 

{a,q)=l 

We will need some additional notation. We let e(x) := e^'^*^. For any real numbers {an}n=i^ 
we define a function S" : T — )• C by 

TV 

^(q;) := ane{na). 

n=l 

For 5 > 0, we say that points ai, . . . ,a/j G T are 5 -separated if ||ai — Qj|| > 5 for all i ^ j, 
where the 1 1 • 1 1 denotes the norm modulo 1 . 

We let Vn denote the set of all partitions of [N] into a union of disjoint intervals. We then 
define 



||5(a)||y := max V 



We note the variational Carleson Theorem |13j : 
Theorem 19. For any real numbers {an}n=i o.'^U r > 2, 

N 



/ \\S{a)\\yrda<,ryZ\an?- 



The case of r = 2 is addressed in the following theorem, which follows immediately from 
Corollary 4 in [SJ: 

Theorem 20. For any real numbers {an}n=i' 

N 



•^'^ n=l 



We note that the log(A^) factor is known to be sharp. We will first prove the following lemma, 
which is a variational version of the analytic large sieve inequality. 

Lemma 21. For any 5 > and for any points ai, . . . , aR € T that are 5-separated, 

R N 



\\S{ai)\\lr «. {N + 6'^ + 1) ^ \an\ 



1=1 n=l 

for any r > 2. Also, 

R N 



J] « {N + + l)log(iV) ^ |a„| 

i=l n=l 



14 



Proof. This proof will be a variational adaptation of the proof of Theorem 5 in [lUj . By a 
theorem of Selberg [16] , there exists an entire function K{z) such that K is real- valued on M, 
K(x) > for all real x, and K{x) > 1 for all 1 < x < A^. Moreover, K{x) is integrable, and 
K{0) = — 1 + 6^^. By a theorem of Fejer, there is another entire function k{z) such that 
K{x) = |A;(x)p for all x € M, and k (the fourier transform of k{x)) has support in (— |, |)- We 



note that 



k{x) 



s 

2 ^ 



k{Oe{xC)d^. 



We define a function T : T ^ C by 

T{a) := ank{n)~^e{ 

Similarly, we define 



N 



na I 



iin«)iiv 

For any a and any r > 2, we have 

\\S{a)\\vr 



max 



E 



n) e[na, 



max 

it£Vn 



max 



max 

TTG-PiV 



E 
E 



nei 



[na, 



riG/ 



§_ 

2 



n&I 



By Minkowski's integral inequality (see [3|, Theorem 201 for example), this last quantity is 



< max 



E 



2 V/gTT 



J*K)I|E 



^a„A;(n) -^e(n(a + ^)) 



^y_j*K)i-?5iE 



/Gtt 



^a„/i:(n) -^e(n(a + ^)) 



Now applying the Cauchy-Schwarz inequality, we see this is 



< 



By the properties of k and K, we have 



\\ni + a)\\lrdi 



\k{x)\^dx 



K{x)dx 



Km 



(iv - 1 + 5-^)2 . 
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Therefore, we have shown that 

^ I \S{a,)\ \l.^<(N-l + 6-')Y, ] I m + o^i) I \lrdi. 

1=1 i=l ''~2 

Since the a^'s are 5-separated, the ranges (— | + Oj, | + Oj) are disjoint in T, and hence 



Y^\\S{a,)\\lr<{N-l + 5-') [ Wmfvrd^. 

^=l 



(18) 



For r > 2, we may apply Theorem 1191 for the real numbers {ank{n) ^}n=i ^° conclude that 
the righthand side of (fT8|) is 



AT 



<.r {N -l + 5-^)^\ank{n)-^\^. 

n=l 

Recalling that |^^^-^|^ = -j^f^ and K{n) > 1 for all n from 1 to N, we obtain 

R N 

i=l n=l 

for all r > 2, as required. 

For r = 2, we may apply Theorem 1201 for the real numbers {ank{n)^^}^^i to conclude that 
the righthand side of (fTSj) is 

N N 

< (iV - 1 + 6-^) log(iV) J2 Kkiny^l^ ^{N-l + 6'^) log{N) ^ |a„|2. 

n=l n=l 

We next prove the following lemma: 
Lemma 22. For any real numbers {an}n=ij 

N 



□ 



q<Q xmodg/GTr n=l 



Proof. This proof is adapted from the proof of Theorem 4 in [2j. For a character x modulo q, 
we define the complex value r(x) by 



We have 



for all n for all primitive x (see [2], chapter 9). Therefore, for any interval / and any primitive 
X modulo q, 

T{l,x) = Yanx{n) = rir E^^") E ""M V ) • 



ne/ '^^^'^ a<q n&I V 9 / 
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We then note that when x is primitive, |t(x)| = 92 (see [2], p. 66). This yields (for any 
partition vr): 



Now summing over primitive characters, we have 



a<q n&I 



IdiT X mod q 



a<q nel 



This quantity can only increase if we sum over all characters modulo q, so this is 

^ IeiTa<qb<q \nel \ / / \ / / ^modo 



(19) 



Here, e denotes conjugation. We note that this innermost sum over characters is equal to 
unless a = b and a is coprime to q. In this case, it is equal to (piq). Plugging this into p^ . we 
have 

*(<!) 



E E 

/Stt a<q 
(a,q) = l 



Ef an\ 



Recalling that 



we conclude that 



115(a) 1 1 



max 



E 



ane{na) 



E 



max 

ttG-Pjv 



E'Ei^(^.^)i'^ « EsS E 

(a,g)=l 



< E E 



Ef an\ 

nG/ \ / 

|2 

Iv2- 



(a,g)=l 



Here we have used the fact that moving the maximum inside the sum over a's coprime to 
q can only make the quantity larger. The points | as g ranges from 1 to Q o ranges 
over values coprime to q are ^-separated as points in T. Thus, applying Lemma fH\ this is 

□ 



<^{N + Q^)log{N)En=iK\'. 

We are now equipped to prove Theorem [5l We recall Lemma [171 which states that 

h E xia)e'{i,x). 



e{I;q,a) 



X mod q 



We then have 



E™^^ E E(^(^;9' 

q<Q ^ a<q I£tt 
{a,q)=l 



a<q I£tt 
{a,q)=l 



E X(a)^'(/,X) 
xmodg 
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By expanding the square and rearranging the sums inside the maximum, this is 



max 
^ 4>[Q) '^^'^^ V 

q<Q jGtt XI mod g xzmodq 



a<q 
{a,q)=l 



This innermost sum over a's coprime to q is equal to (j){q) whenever xi = X2, and is equal to 
otherwise. Hence this quantity is 



V-i-maxV V \e'{I,x)\'- 

q<Q /e-rrxmodg 



(20) 



Each character xmodf^ is induced by some primitive character ximodgi, where q\ divides 
q. Applying the triangle inequality, we see that (pOj) is 



(I>(o) TrePi 
q<Q ^ ' /GTTxmodg 



q<Q /GTTxmodg 



We consider the second quantity in (j2T|) . This is 



< 



g<Q ^^^^ \/e7rxmodg 



Recalling the definitions of 6'{I,x) and 9'{I,xi), we note that 

\9'{i,x)-e'ii,xi)\<Y.^ogp. 



pel 
p\q 



Thus, the second quantity in (|21|) is 



< 



y- 



q<Q 



max 



E E Ei°g(^') 



I&TT X mod q pGl 

p\q 



J 



E'^('?) 

q<Q 



J2 



1 p<^ 

\p\q 



Since Ep|glog(?') < log(g), this is 



< HQ)^og\q) « Q2log2(Q) < xQlog2(x), 

for Q < X. 

It now suffices to bound the first quantity in ()2ip . Every primitive character xiniodgi 
induces characters modulo (7 for every q that is a multiple of gi. Also, the set of primitive 
characters xi inducing characters modulo q can be divided into primitive characters modulo 
each divisor of q. By applying the triangle inequality and maximizing separately for each divisor 
of g, we see that for each q: 

sE E i^Uxi)p<EsE E* 

JSTrxmodtj qi\q /Stt xi mod gi 
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Now summing over q and accounting for the multiple occurrences of each qi, we have 

q<Q ^ ' Tre/ xmodg q<Q leir xmodq \j<^ J 

As previously noted, this final sum over j is <^ 0(9)^^ ^og{2Q/q). Hence the above quantity is 

log(2Q/g) ^ la'fT m2 

q<Q /STrxmodg 

As in the proof of Theorem HI we break this sum over q into smaller ranges. For any 
1 < [/ < Q, we have 

U<q<2U /eTTxmodg q<2U /GTrxmodg 

(22) 

(note that the change of notation from 9' to does not change any values). We may now apply 
Lemma [22] with = log(p) for all primes p and = otherwise. We conclude that the 
righthand side of (p2]l is 

< U-^\og{2Q/U){x + C/2)log(x) I ^log2(p) I < U-^\og{2Q/U){x + U'^)x\og^ {x) . 

\p<x J 

We define Qi = log'^'''^(2;). Setting U = Q2~^ and summing over j from to J := 
\log{Q/Qi)'], we see that 

^ i^^g^max^ Y,* \e'{I,x)\' «Q-'xlogHx)Y,2^ij + l){x + Q'2-'^). 

Since EjloO'+l)^"^ converges and E/=o(i+l)2^ ^ -^2'^, this quantity is < Q^'^x^ log^(x) \og{Q)+ 
xQ log^(x). Recalling that Qi = log'^''"^(x) and x log~'^(x) < Q < x,we see this is <C xQ log^(x) 
as required. 

It only remains to bound the contribution from the values of q < Qi. We observe that 

E ^^tS^sE E" !«'(', rip E E's-E !«'('■ 



Q^Qi I X mod q <?<Qi xmodg /Gti 

Now, every primitive character modulo q for g > 1 is non-principal. For the principal character 
Xo modulo 1, the value of 9'{I,xo) is for any /, so we may rewrite our quantity as 

>°s«3) E ^E'sEi«(^.rii^. 

l<g<Qi ^^^^xmodg len 

Applying Lemma [TJ] for the constant j4 + 1, we see this is 

l<q<Ql ^^^'x^odq 

for some positive constant ca depending only on A. Since there are (j){q) characters modulo q 

(and hence at most (p{q) primitive ones), this is <Ca Qi log{Q)x'^e~'^^^"^^ Recalling that Q < 

X and Qi = log'^"'"^(x) and noting that e""^-*^"^^' ' log~^'^(x), we see this <Ca xQlog^{x). 
This completes the proof of Theorem [5j 
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5 A Variational Form of the Large Sieve Inequality 



We now prove another variational form of Theorem [TTJ This will refine an estimate of Uchiyama 
stated below. The techniques are similar to those used above. We let Vm,n denote the set of 
partitions of [M + 1,M + N]. 

Lemma 23. For all positive integers Q,M,N and complex numbers {an}^JJ^_^_l, 



N+M 



q<Q ^^^\modq ' leir n=M+l 



(23) 



Proof. Without loss of generality, we assume that N = 2'' for some k (rounding up to the 
nearest power of 2 can be absorbed by the implicit constant). The interval from M + 1 to M + N 
can then be decomposed into dyadic intervals of the form Ic/ := (M + (c — 1)2^, M + c2^] for 
each £ from to k, where c ranges from 1 to 2^~^. If we fix £ and let c vary, we refer to the 
resulting set of intervals as the dyadic intervals on level £. 

For a fixed q and primitive character x mod q, we consider a maximizing partition vr* in 
Vm,n- By Lemma [T6l every interval / G vr* can be decomposed as a union of O(logA^) dyadic 
intervals of the form I^./ for varying c and £ values, such that there are at most 2 intervals 
included on each level. We let D{I) denote the set of dyadic intervals in the decomposition of 
/. For each £, D(^{I) denotes the subset of intervals in D{I) on level £. 

We can now express the square root of the left hand side of ([23]) as: 



Q'^Q xraod q I diT* 

We can alternatively express this as: 



.J&D{I) 



\ 0(9) ^ ^ 

\ q<Q ^^^'xmodqldTT* 



1=0 JeDeil) 



Applying the triangle inequality for the £'^ norm, this quantity is 



q<Q ^^^^x^odq lew 



E ^(X"^) 

JeDeil) 



(24) 



For a fixed £, we consider the quantity 



(f)(q) ^ ^ 



E ^(^"^) 



(25) 



We note that the innermost sum contains at most two intervals J, since the dyadic decomposition 
of each / contains at most two intervals on level £. Noting that |a + 6p <C |ap + |6p holds for all 
complex numbers a and b and that each dyadic interval on level i can occur in the decomposition 
of at most one / € vr*, the quantity in (|25|) is 



q<Q ^^^\modq c=l 
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Now the innermost sum is simply over the set of dyadic intervals on level £. Reordering the 
finite sums, this is 

c=l q<Q ^^^^ xmodg 

For each c, we can apply Theorem [11] to obtain: 

q<Q xmodg ne/c,f 

Thus, the quantity in ([25]) is < {2^ + Q^) ^iS+i 

Substituting this back into , we see that the square root of the left hand side of ([25]) is 



i^' + Q') E 



n=Af+l 



Hence, the left hand side of (1231) is 



an 



n=AI+l 



Recalling that 2^ = N and loosely bounding 2^ + < 2'^ + for all I, we see this is 
« (A:72^TQ2) ^ \an? = k\2^ + Q^) ^ |a„p < log2(Ar)(iV + q2) ^ |^^|; 

n=M+l n=M+l n=M+l 

We note that this refines Uchiyama's Maximal large sieve inequality p!5], which states: 
Lemma 24. For all positive integers Q,M,N and complex numbers {an}^lJ'j^^i, 

N+M 

E^E*sJ^(^'^)i'«i°g'(^)(^+Q') E Ki'- 

q<Q ^^^'xmodq ' n=M+l 



□ 



(26) 



Montgomery [12] has asked if the log^(A^) can be removed. We do not have an answer to 
this question (though we obtain a lower bound on a related quantity in |9j). We note that the 
log^(A^) cannot be completely removed in our variational refinement. 

To see this, we use the lemma below, which follows easily from Lemma 22 in ^ and the 
Cauchy-Schwarz inequality: 

Lemma 25. Let ci, . . . ,cn denote complex numbers |cj| > 6, for some 6 > 0. Let Xi, . . . , Xjv 

denote independent Gaussian random variables each with mean and variance 1. Then 



E 



sup 



nel 



> (52Afloglog(iV) 
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Strictly speaking, the lemma in [S] is stated for real ci, . . . , cat, but it can easily be deduced 
for complex numbers by splitting into real and imaginary parts. Now we consider ()23p with 
interval [A^] and a„ = X„ for each n € [A'^] . To apply Lemma [23 for each q and each character 
modulo q, we consider only the indices n such that n is coprime to q. On these values, the 
character will be nonzero. We let C{q) denote the number of these indices for each q. Then, 
from Lemma [25l the expectation of the left hand side can be estimated by 

» j;<7-C(g)-loglog(C(g)). (27) 

q<Q 

Now, we note that J2q<Q ^il) is equal to the number of pairs (n, q) such that n and q are 
coprime. This is also equal to Yln=iQ^^^ir'- Theorem 330 in [5], X]^=i = '^^'^ ~^ 
0(iV log(iV)), which implies J2n=i ^ » iV- Thus, J2q<QC{q) » QN. It follows that there 
exist positive constants e, 6 such that C{q) > eN for at least 5Q values of q. Hence, the quantity 
in (|27|) is ^ Q'^ N loglog{N) , while the expectation of the sum of squares of the a„ = X„ will be 
<C A^. Thus, at least when <C Q^, one needs at least a factor of loglog(A^) in ([23|l . Refining 
the gap between loglog(A^) and log^(A^) would be interesting. 
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